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ABSTRACT 

MRI  brain  image  plays  a  vital  role  in  assisting  radiologists  to  access  patients  for  diagnosis  and  treatment. 
Studying  of  medical  image  by  the  Radiologist  is  not  only  a  tedious  and  time  consuming  process  but  also  accuracy  depends 
upon  their  experience.  So,  the  use  of  computer  aided  systems  becomes  very  necessary  to  overcome  these  limitations.  Even 
though  several  automated  methods  are  available,  still  segmentation  of  MRI  brain  image  remains  as  a  challenging  problem 
due  to  its  complexity  and  there  is  no  standard  algorithm  that  can  produce  satisfactory  results.  In  this  review  paper,  various 
current  methodologies  of  brain  image  segmentation  using  automated  algorithms  that  are  accurate  and  requires  little  user 
interaction  are  reviewed  and  their  advantages,  disadvantages  are  discussed.  This  review  paper  guides  in  combining  two  or 
more  methods  together  to  produce  accurate  results. 
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INTRODUCTION 

Magnetic  resonance  imaging  (MRI)  of  the  brain  is  a  safe  and  painless  test  that  uses  magnetic  field  and  radio 
waves  to  produce  detailed  images  of  the  brain  and  brain  stem.  Magnetic  resonance  imaging  differs  from  computer 
tomography  (CT)  because  it  does  not  use  radiation.  MRI  can  detect  a  variety  of  conditions  of  the  brain  such  as  cysts, 
tumours,  bleeding,  swelling,  structural  abnormalities,  infections  or  problems  with  the  blood  vessels.  MRI  of  the  brain  can 
be  useful  in  evaluating  problems  such  as  persistent  headaches,  dizziness,  weakness,  seizures  and  it  can  help  to  detect 
certain  chronic  diseases  of  the  nervous  system  such  as  multiple  sclerosis.  In  some  cases,  MRI  can  provide  clear  images  of 
parts  of  the  brain  that  can't  be  seen  with  an  x-ray,  CT  scan  or  ultrasound. 

There  are  many  different  types  of  pediatric  brain  tumours  ranging  from  those  that  can  be  cured  with  minimal 
therapy  to  those  that  cannot  be  cured  even  with  aggressive  therapy.  Some  of  the  common  types  are  Astrocytomas, 
Ependymomas,  Brainstem  gliomas,  Germ  cell  tumours,  Craniopharyngiomas.  Segmentation  of  brain  into  various  tissues 
like  gray  matter,  white  matter,  cerebrospinal  fluid,  skull  and  tumour  is  very  important  for  detecting  tumour,  edema,  and 
hematoma.  Most  research  in  developed  countries  has  exposed  that  the  death  rate  of  people  affected  by  brain  tumor  has 
increased  over  the  past  three  decades  [39].  A  tumour  is  a  mass  of  tissue  that  grows  out  of  control  of  the  normal  forces  that 
regulates  growth  [50].  Tumours  can  directly  destroy  all  healthy  brain  cells.  It  can  also  indirectly  damage  healthy  cells  by 
crowding  other  parts  of  the  brain  and  causing  inflammation,  brain  swelling  and  pressure  within  the  skull  [28].  In  the  early 
research  of  medical  tumor  detection,  the  algorithms  have  directly  used  the  classic  methods  of  image  processing  (Such  as 
edge  detection  and  region  growing)  based  on  gray  intensities  of  images.  In  recent  years,  the  classification  of  human  brain 
in  MRI  images  is  possible  via  supervised  techniques  such  as  k-nearest  neighbour,  Artificial  neural  networks  and  support 
vector  machine(SVM)  and  unsupervised  classification  techniques  such  as  self  organization  map(SOM)  and  fuzzy  C -means 
algorithm  have  also  been  used  to  classify  the  normal  or  pathological  T2  weighted  MRI  images. 


TRANS 

STELLAR 

•Journal  Publications  •  Research  Consultancy 


12 


D.  Selvaraj  &  R.  Dhanasekaran 


Even  though  many  algorithms  are  available  for  detecting  brain  tumour,  the  detection  rate  is  still  not  satisfactory. 
In  this  paper,  various  approaches  of  MRI  brain  image  segmentation  methods  are  discussed  in  section  2  and  Feature 
extraction  methods  are  reviewed  in  section  3.  In  section  4,  the  various  classifiers  used  for  classifying  brain  image  are 
discussed  and  finally  the  suitable  method  for  segmentation  and  classification  are  concluded  in  section  5. 

MRI  BRAIN  TISSUE  SEGMENTATION  TECHNIQUES 

Various  segmentation  methods  have  been  cited  in  the  literature  for  improving  the  segmentation.  Some  of  the 
commonly  used  methods  are  summarized  as  in  Figure  1 . 
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Figure  1:  MRI  Brain  Image  Segmentation  Methods 


Thresholding 


Thresholding  is  one  of  the  frequently  used  method  for  image  segmentation.  This  method  is  effective  for  images 
with  different  intensities.  Using  this  method,  the  image  is  partitioned  directly  into  different  regions  based  on  the  intensity 
values.  Thresholding  is  defined  mathematically  [62]  as  Eq.  (1).  Let  f(x,y)  be  the  input  image  and  'T'  be  the  threshold  value 
then  the  segmented  image  g(x,y)  is  given  by, 


g(x,  y)  = 


1,  if  f(x,y)>T 
0,  if  f(x,y)<T 


(1) 


Using  the  above  Eq.  (1),  the  image  can  be  segmented  into  2  groups.  If  we  want  to  segment  the  given  image  into 
multigroups  then  we  should  have  multi  threshold  point. 

If  we  have  2  threshold  values,  then  the  above  equation  becomes  as  Eq.  (2)  and  this  equation  segments  the  image 
into  3  groups 


g(x,  y)  = 


a,  if  f(x,y)>T2 

b,  if  T,  <f(x,y)<T2 

c,  if  f(x,y)<T, 


(2) 


D.  Bhattacharyya  and  Tai-hoon  kim  [8]  have  proposed  an  image  segmentation  technique  to  identify  the  tumor 
from  the  brain  magnetic  resonance  imaging  (MRI).  Several  existing  thresholding  techniques  have  produced  different  result 
in  each  image.  Thus,  to  produce  a  satisfactory  result  on  brain  tumor  images,  they  have  proposed  a  technique,  where  the 
detection  of  tumor  was  done  uniquely.  A  survey  on  available  thresholding  techniques  are  provided  in  the  literature  [14,  54, 
60,  74]. 
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Otsu's  Thresholding 

Otsu  method  is  the  commonly  used  thresholding  technique.  Otsu's  thresholding  technique  is  based  on  a 
discriminant  analysis  which  partitions  the  image  into  two  classes  Ci  and  C2  at  gray  levels  'k'  such  that  Ci  =  {  0,1,2,3,...,  k} 
and  C2  =  {  k+1,  k+2,...,  L-l}where,  'L'  is  the  total  number  of  gray  levels  of  the  image.  Let  'n'  be  the  total  number  of 
pixels  in  the  given  image  and  'nj'  be  the  number  of  pixels  at  the  ith  gray  level.  The  probability  of  occurrence  of  gray  level  is 
defined  as, 

Pi=^  (3) 
n 

'Ci'  and  'C2'  are  two  classes  representing  the  region  of  interest  and  the  background. 
The  probabilities  of  classes  d  and  C2  are, 

Pi(k)=IPi  (4) 

i=0 

P2(k)=  SPi=l-Pi(k)  (5) 

i=k+l 

The  mean  intensity  values  of  these  two  classes  are 

m1(k)  =  -i-Zi.PI  (6) 
P[(k) i=o 

m2(k)  =  -r777T  (7) 
P2(k)  i=k+i 

where,  m,(k)  and  m2(k)  are  object's  center  gray  and  background's  center  gray. 
Disadvantages: 

•  Otsu  method  can  segment  only  larger  objects  from  background 

•  Otsu  method  fails  if  the  histogram  is  unimodal  or  close  to  unimodal 
Local  Thresholding 

Threshold  values  are  selected  locally  by  dividing  an  image  into  sub -images  and  calculate  a  threshold  value  for 
each  part.  A  local  threshold  takes  more  computation  time  than  global  thresholding.  Its  result  is  satisfactory  in  background 
variations  in  an  image.  It  can  extract  only  small  regions  [46]. 

Histogram  Thresholding 

Histogram  thresholding  segmentation  is  based  up  on  thresholding  of  histogram  features  and  gray  level 
thresholding.  Threshold  is  defined  mathematically  as  Eq.  (1) 

Algorithm  [56,  62]. 

Stepl:  The  MRI  brain  image  is  divided  into  two  equal  halves  around  its  central  axis  and  the  histogram  of  each 
part  drawn. 

Step2:  Threshold  point  of  the  histogram  is  calculated  by  comparing  2  histograms. 
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Step3:  Segmentation  is  done  using  the  threshold  point  for  both  the  halves. 

Step4:  The  detected  image  is  cropped  along  its  contour  to  find  out  the  physical  dimension  of  the  tumour. 

Step5:  Create  an  image  of  the  original  size,  check  the  segmented  image  pixel  value.  If  its  value  is  greater  than 
threshold  value  then  assign  255  else  0. 

Step6:  Segment  the  tumour  area. 

Step7:  Calculate  the  area  of  the  tumour 
Edge  Based  Segmentation 

Edge  based  segmentation  methods  partition  an  image  based  on  rapid  changes  in  intensity  near  edges  [23,  48].  The 
result  is  a  binary  image.  Based  on  theory  there  are  two  main  edge  based  segmentation  methods-  gray  histogram  and 
gradient  based  method  [32].  The  result  of  edge  detection  technique  mainly  depends  upon  selection  of  threshold  T  [32]. 
Common  edge  detection  operators  used  in  gradient  based  method  are  sobel  operator,  canny  operator,  Laplace  operator, 
Laplacian  of  Gaussian  (LOG)  operator  &  so  on,  canny  is  most  promising  one  [23],  but  takes  more  time  as  compared  to 
sobel  operator. 

Edge  detection  methods  require  a  balance  between  detecting  accuracy  and  noise  immunity.  If  the  level  of 
detecting  accuracy  is  too  high,  noise  may  bring  in  fake  edges  making  the  outline  of  images  unreasonable  and  if  the  degree 
of  noise  immunity  is  too  excessive  [32],  some  parts  of  the  image  outline  may  get  undetected  and  the  position  of  objects 
may  be  mistaken.  Thus,  edge  detection  algorithms  are  suitable  for  images  that  are  simple  and  noise-free  as  well  often 
produce  missing  edges  or  extra  edges  on  complex  and  noisy  images  [69]. 

Region  Based  Segmentation 

Compared  to  edge  detection  method,  segmentation  algorithms  based  on  region  are  relatively  simple  and  more 
immune  to  noise  [32,  79].  Edge  based  methods  partition  an  image  based  on  rapid  changes  in  intensity  near  edges  whereas 
region  based  methods,  partition  an  image  into  regions  that  are  similar  according  to  a  set  of  predefined  criteria  [23,  31]. 
Segmentation  algorithms  based  on  region  mainly  include  following  methods. 

Region  Growing 

Region  growing  method  is  one  of  the  popular  segmentation  methods.  This  method  starts  with  a  seed  pixel  and 
grows  the  region  by  adding  the  neighbouring  pixels  based  on  threshold  value.  When  the  growth  of  a  region  stops,  another 
seed  pixel  which  does  not  belong  to  any  other  region  is  chosen  and  the  process  is  repeated  [42].  The  region  growing  is 
stopped  when  all  pixels  belongs  to  some  region. 

Region  growing  segmentation  is  particularly  used  for  delineation  of  small,  simple  structures  such  as  tumours  and 
lesions  [20,  57,  59].  The  various  limitations  in  using  this  method  are,  1.  Sometimes,  manual  interaction  is  required  to  select 
the  seed  point.  2.  Sensitive  to  noise  so  it  produces  holes  or  over  segmentation  in  the  extracted  regions.  The  discontinuity  in 
the  extracted  image  can  be  removed  by  using  homotopic  region  growing  algorithm  [9]. 

Region  Splitting  and  Merging 

In  this  method,  the  image  is  splitted  into  various  regions  depending  on  some  criterion  and  then  it  is  merged.  The 
whole  image  is  initially  taken  as  a  single  region  and  then  internal  similarity  is  computed  using  standard  deviation.  If  too 
much  variety  occurs  then  the  image  is  split  into  regions  using  thresholding.  This  is  repeated  until  no  more  splits  are  further 
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possible.  Quadtree  is  a  common  data  structure  used  for  splitting.  Then  comes  the  merging  phase,  where  two  regions  are 
merged  if  they  are  adjacent  and  similar.  Similarity  can  be  measured  by  comparing  the  mean  gray  level  or  using  statistical 
tests.  Two  regions  Ri  and  R2  are  merged  into  R3  if, 

H(R1  U  R2)  =  True  and  (8) 

Iml  -  m2l  <  T  (9) 

Where,  mi  and  m2  are  the  mean  gray  level  values  in  the  regions  Ri  and  R2;  T  is  some  appropriate  threshold  [23]. 
Merging  is  repeated  until  no  more  further  merging  is  possible.  The  major  advantage  of  this  technique  is  guaranteed 
connected  regions.  The  drawbacks  of  the  split  and  merge  technique  are,  l.The  results  depend  on  the  position  and 
orientation  of  the  image,  2.  Regular  division  leads  to  over  segmentation  by  splitting.  This  drawback  can  be  overcome  by 
reducing  number  of  regions  by  using  Normalized  cuts  method.  Improved  Quadtree  method  for  split  and  merge  is 
introduced  in  [19].  In  this  improved  method  they  have  used  three  steps  first  splitting  the  image,  second  initializing 
neighbours  list  and  the  third  step  is  merging  splitted  regions.  The  survey  says  split  and  merge  algorithm  is  a  fast 
computation  method.  Its  drawback  is  lack  of  sensitivity  to  image  semantics  [20,  73] 

Watershed  Segmentation 

Watershed  segmentation  algorithm  can  be  used  if  the  foreground  and  the  background  of  the  image  can  be 
identified.  Watershed  algorithm  is  also  used  to  capture  weak  edges.  Selection  of  seed  point  is  the  main  drawback  of  this 
approach.  Random  selection  of  seed  point  may  lead  to  inappropriate  results  and  increases  convergence  rate.  So,  water 
segmentation  is  not  suitable  for  MRI  brain  image. 

Snakes 

Active  contours  or  snakes  are  computer  generated  curves  [29]  that  move  within  the  image  to  find  object 
boundaries  under  the  influence  of  internal  and  external  forces.  The  internal  forces  are  responsible  for  smoothness  while  the 
external  forces  guide  the  contours  towards  the  contour  of  ROI.  It  requires  user  interaction,  which  consists  of  determining 
the  curve  around  the  detected  object  [33].  Snake  should  be  placed  usually  near  the  boundary  of  ROI  [82] 

Clustering 

A  Clustering  is  one  of  the  most  useful  techniques  in  MRI  Segmentation,  where  it  classifies  pixels  into  classes, 
without  knowing  previous  information  or  training.  It  classifies  pixels  with  highest  probability  into  the  same  class. 
Clustering  technique  training  is  done  by  using  pixel  features  with  properties  of  each  class  [49,  51,  55  and  70]. 

K-Means 

K-means  clustering  algorithm  is  the  simplest  unsupervised  learning  algorithm  that  can  solve  clustering  problem. 
The  procedure  followed  to  classify  a  given  set  of  data  through  a  certain  number  of  clusters  is  very  simple.  In  K-means  'K' 
centres  are  defined,  one  for  each  cluster.  These  clusters  must  be  placed  far  away  from  each  other.  The  next  step  is  to  take  a 
point  belonging  to  a  given  data  set  and  associate  it  to  the  nearest  centre.  When  no  point  is  pending,  the  first  step  is 
completed  and  early  grouping  is  done.  The  second  step  is  to  re-calculate  'k'  new  centroids  as  barycentre  of  the  clusters 
resulting  from  the  previous  step.  After  having  'K'  new  centroids  a  new  binding  has  to  be  done  between  the  same  data  set 
points  and  the  nearest  new  centre.  A  loop  has  been  generated.  As  a  result  of  this  loop,  the  k  centres  change  their  location 
step  by  step  until  centres  do  not  move  any  more.  Finally  this  algorithm  aims  at  minimizing  an  objective  function  known  as 
squared  error  function  given  by, 
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J(V)  =  _ZZ  xi-vj 


i=lj=l' 


(10) 


Where,  ||xi-vj||  is  the  Euclidean  distance  between  x;and  Vj 

'Q'  is  the  number  of  data  points  in  ith  cluster. 
'C  is  the  number  of  cluster  centres. 

K-means  algorithm  is  fast,  robust  and  easier  to  understand.  It  also  gives  better  result  when  data  set  are  well 
separated  from  each  other.  But,  if  there  are  2  highly  overlapping  data  then  k-means  will  not  be  able  to  resolve  that  there  are 
2  clusters.  Qurat-ul  Ain  et  al.  [52]  has  proposed  a  robust  system  for  brain  tumor  diagnosis  as  well  as  for  brain  tumor  region 
extraction.  Initially,  the  proposed  system  has  diagnosed  the  tumor  from  the  brain  MR  images  by  bayes  classification.  After 
the  diagnosis,  the  K-means  clustering  and  boundary  detection  techniques  have  been  applied  to  extract  the  exact  brain  tumor 
region.  Here,  above  99%  accuracy  has  been  achieved  for  diagnosis.  Experimental  results  have  shown  that  the  proposed 
system  has  extracted  accurately  tumor  region. 

Fuzzy  C-Means  (FCM) 

FCM  clustering  is  an  unsupervised  method  for  the  data  analysis.  This  algorithm  assigns  membership  to  each  data 
point  corresponding  to  each  cluster  centre  on  the  basis  of  distance  between  the  cluster  centre  and  the  data  point.  The  data 
point  near  to  the  cluster  centre  has  more  membership  towards  the  particular  centre.  Generally,  the  summation  of 
membership  of  each  data  point  should  be  equal  to  one.  After  each  iteration,  the  membership  and  cluster  centres  are  updated 
according  to  the  formula 


Where,  'n'  is  the  number  of  data  points 

'Vj'  represents  the  jth  cluster  centre 

'm'  is  the  fuzziness  index  m€[l,co] 

'c'  represents  the  number  of  cluster  centre 

'Hy'  represents  the  membership  of  ith  data  to  jth  cluster  centre. 

'dy  'represents  the  Euclidean  distance  between  ith  data  and  jth  cluster  centre. 

'xj'  is  the  ith  of  d-dimensional  measured  data 

'cj'is  the  d-dimension  centre  of  the  cluster 

||  *  ||  is  any  norm  expressing  the  similarity  between  any  measured  data  and  the  centre. 


(11) 


1 


(12) 
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dij=  ||xi-cj||,dik=  ||xi-ck|| 


The  main  objective  of  fuzzy  c-means  algorithm  is  to  minmize 


j(u,v)=ii(ixiJ)m 


I  ii2 

x;-v;  l<m<co 


i=ij=i 


(13) 


Where, 


is  the  Euclidean  distance  between  i'  data  and  j   cluster  centre. 


The  FCM  algorithm  gives  best  result  for  overlapped  data  set  and  also  gives  better  result  than  k-means  algorithm. 
Here,  the  data  point  can  belong  to  more  than  one  cluster  centre.  The  main  drawback  of  FCM  is  1)  the  sum  of  membership 
value  of  a  data  point  Xj  in  all  the  clusters  must  be  one  but  the  outlier  points  has  more  membership  value.  So,  the  algorithm 
has  difficulty  in  handling  outlier  points.  2)  Due  to  the  influence  of  all  the  data  members,  the  cluster  centres  tend  to  move 
towards  the  centre  of  all  the  data  points  [17].  It  only  considers  image  intensity  thereby  producing  unsatisfactory  results  in 
noisy  images  [24]. 

A  bunch  of  algorithms  are  proposed  to  make  FCM  robust  against  noise  and  in  homogeneity  but  it's  still  not 
perfect  [24,  40,  2,  64,  18,  78].  Many  approaches  have  been  made  to  modify  the  existing  standard  FCM  algorithm  to 
improve  its  performance.  Each  of  the  modified  FCM  algorithms  proposes  a  new  membership  function  for  calculating  the 
membership  of  data  points  in  clusters.  A  new  distance  function  based  on  dot  product  instead  of  the  conventional  Euclidean 
distance  [22].  The  introduced  new  membership  function  is  given  in  Eq.(14). 


Other  improved  versions  of  FCM  by  the  modification  of  the  objective  function  were  introduced  by  [3]  and  [78].  In 
the  proposed  IFCM  algorithm  of  [61]  during  clustering,  each  pixel  attempts  to  attract  its  neighbouring  pixels  towards  its 
own  cluster.  Here  the  attraction  depends  up  on  the  intensity  of  the  pixel.  In  MRI  brain  images,  the  presence  of  noise  will 
alter  the  pixel  intensity.  Due  to  this  the  segmentation  may  be  affected  so,  instead  of  modifying  the  objective  function,  the 
measurement  of  similarity  was  extended  by  considering  neighbourhood  attraction  [58]. 

Chandra,  S  et  al.  [10]  has  proposed  a  Particle  Swarm  Optimization  (PSO)  based  clustering  algorithm.  The 
proposed  algorithm  has  identified  the  centroids  of  number  of  clusters,  where  each  cluster  has  grouped  together  the  brain 
tumor  patterns,  obtained  from  MR  Images.  The  results  obtained  for  three  performance  measures  have  been  compared  with 
those  obtained  from  Support  Vector  Machine  (SVM)  and  Ada  Boost.  The  performance  analysis  has  shown  that  the 
qualitative  results  of  proposed  model  are  analogous  with  those  obtained  by  SVM.  Moreover,  the  different  values  of  PSO 
control  parameters  have  been  selected  in  order  to  acquire  better  results  from  the  algorithm. 

Expectation  Maximization 

A  model  based  tumour  segmentation  technique  was  implemented  by  [44].  This  approach  uses  a  modified 
Expectation  Maximization  (EM)  algorithm  to  differentiate  the  healthy  and  the  tumorous  tissues.  A  set  of  tumour 
characteristics  are  presented  in  this  paper  which  is  highly  essential  for  accurate  segmentation.  But  the  drawback  of  this 
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work  is  the  lack  of  quantitative  analysis  on  the  extracted  tumour  region.  The  applications  of  the  EM  algorithm  to  brain  MR 
image  segmentation  were  reported  by  [72]  and  [37].  A  common  disadvantage  of  EM  algorithm  is  that  the  intensity 
distribution  of  brain  images  is  modeled  as  a  normal  distribution. 

Hierarchical  Clustering 

Hierarchical  clustering  method  works  by  grouping  data  objects  into  a  tree  of  clusters.  Hierarchical  clustering 
doesn't  require  specifying  the  number  of  clusters.  Hierarchical  clustering  is  deterministic.  There  are  2  types  of  hierarchical 
clustering.  1.  Agglomerative  clustering,  2.  Divisive  Clustering. 

In  Agglomerative  clustering,  each  element  is  treated  as  a  single  cluster  and  then  merged  (agglomerated)  until  all 
merge  in  a  single  cluster,  which  results  in  dendograms  formation  [36].  The  disadvantage  of  this  method  is  1)  They  do  not 
scale  well  the  time  complexity  of  at  least  0(n2),  where  n  is  the  number  of  total  objects.  2)  They  can  never  undo  what  was 
previously  done.  Divisive  hierarchical  is  more  efficient  than  Agglomerative  hierarchical  clustering  [36].  Divisive 
Hierarchical  Clustering  can  be  stopped  when  the  goal  is  achieved. 

Ant  Tree  Algorithm 

The  Ant  tree  algorithm  [5]  was  introduced  for  MRI  brain  image  segmentation.  The  general  Ant -Tree  algorithm 
cannot  be  used  in  MRI  Brain  image  segmentation  directly  [75,  38].  The  general  Ant -Tree  algorithm  produces  a  hierarchical 
structure  in  an  incremental  manner  in  which  the  ants  join  together.  In  this  algorithm,  each  ant  represents  as  a  single  datum 
from  the  data  set  and  it  moves  in  the  structure  according  to  the  similarity  sim  (i,  j)  with  the  other  ants  already  connected  in 
the  tree  under  construction.  In  order  to  make  a  partition  of  the  whole  data  set,  a  more  appropriate  decision  is  made  when 
determining  the  place  in  which  each  ant  will  be  connected,  either  to  the  main  support  (Generates  a  new  cluster)  or  to 
another  ant  (refines  an  existing  cluster). 

In  improved  Ant  tree  algorithm  [38],  the  tree  structure  of  ant  tree  algorithm  ensures  that  the  new  ants  don't  need 
to  search  the  entire  data  set  when  they  are  connected  to  the  structure.  Usually  each  sub-tree  is  directly  connected  to  the 
support  as  a  cluster.  So,  the  hierarchical  structure  of  the  tree  contains  a  large  number  of  redundant  information.  To 
overcome  this  problem,  a  new  method  of  cluster  centre  was  introduced  in  [38]  to  improve  the  tree  model  of  the  ant  tree 
algorithm.  In  [38]  it  is  proved  improved  ant  tree  clustering  algorithm  can  produce  better  results  than  k-means  and  FCM.  In 
addition,  the  processing  speed  of  the  improved  ant  tree  algorithm  is  much  faster  than  the  other  two  clustering  algorithms. 
But,  the  algorithm  is  complex. 

The  ant  tree  algorithm  successfully  segments  the  MR  brain  into  white  matter,  gray  matter  and  cerebrospinal  fluid 
in  a  better  way  than  FCM  and  k-means  algorithm  [38].  In  addition,  the  processing  speed  of  the  ant  tree  algorithm  is  better 
than  the  other  two  algorithms  (FCM,  k-means)  which  are  suitable  for  segmenting  large  scale  MR  brain  images  instantly. 

FEATURE  EXTRACTION 

To  represent  an  image,  large  amount  of  data  is  required  which  occupies  large  amount  of  memory  and  time.  In 
order  to  reduce  the  amount  of  data,  memory  and  time,  the  features  are  extracted  from  an  image.  The  extracted  features 
contain  the  relevant  information  of  an  image.  It  can  be  used  as  an  input  to  the  classifier  for  image  classification  and 
segmentation. 

The  methods  used  to  extract  features  from  MRI  images  are,  1)  Independent  Component  Analysis  2)  Fourier 
Transform  3)  Wavelet  Transform.  Fourier  transform  is  used  for  frequency  analysis  of  an  image  and  wavelet  transform  is 
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used  for  time,  space  and  frequency  analysis.  The  type  of  features  that  can  be  extracted  from  an  image  are  listed  in  table  1 
and  table  2. 

The  Intensity  Based  Feature 

Intensity  based  feature  is  one  of  the  most  widely  used  feature.  The  intensity  feature  such  as  mean,  median,  mode, 
skewness,  kurtosis,  energy,  entropy  are  considered.  Let  f(x,y)  be  a  two  dimensional  function  of  an  image,  h(i)  be  the 
intensity  level  of  an  image,  Ng  be  the  total  number  of  gray  levels  in  the  entire  image  and  p(i)  be  the  probability  density. 
Then  features  that  can  be  extracted  are, 


Table  1:  List  of  Intensity  Based  Feature 


S.No 

Feature 

Expression 

1 

Variation  (a2) 

N„— 1 

Variance,  a2  =  ^(i-|a)2.p(i)  (15) 

i=0 

Z 

Mean 

N.-l 
g 

Mean,  |a  =  ^i.p(i) 

i=0  (16) 

where,  p(i)  -     U  ,  i  -  0, 1,  2,  N   1  (17) 

N  N 

x  y 

Ng-1  Ng-1 

h(i)=  2^  2^8(f(x,y),i),  i  =  0,L2,         N-1  (18) 

x=0  y=0 

5(i,j)  =  r  '  1  =  J.  (19) 
[0  ;  i*j 

3 

Skewness 

Ng-1 

H3=CT-3  ]T(i-p)3.p(i)  (20) 

i=0 

4 

Kurtosis 

Ng-1 

Kurtosis,  |/=cT4  ^((i - |a)4.p(i)) -3 

i=0  (21) 

5 

Energy  (E) 

Ng-1 

Energy,  E=  ^JpC1)]2 

i=0  (22) 

6 

Entropy  (En) 

Ng-1 

Entropy  En  =-  ^jT  p(i).log2[p(i)]  (23) 

i=0 

7 

Range  (R) 

Range,  R  =  Minimum  value  of  pixel  intensity  and  maximum 
value  of  pixel  intensity  in  a  image. 

8 

Pixel  Orientation  (PO) 

Pixel  orientation,  PO  =  Tan  1  { (y  -  m)/  (x  -  m) }  (24) 
Where,  m  -  the  point  which  we  require  to  measure  the 
value,  x,  y  -  point  in  the  x-axis,  y-axis 

Texture  Feature 


Texture  is  a  characteristic  of  an  image  that  provides  higher  order  description  of  an  image  and  includes  information 
about  the  spatial  distribution  of  tonal  variations  or  gray  tones.  The  texture  extraction  defines  the  homogeneity  or  similarity 
between  regions  of  an  image.  Texture  feature  classifies  magnetic  resonance  image  of  brain  in  to  gray  matter,  white  matter, 
cerebrospinal  fluid  and  tumour  region.  The  Haralick  texture  feature  [25]  are, 
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Table  2 


1 

Energy  (E)  or  Angular  second 
moment  (ASM)  or  uniformity 

Ng-1  Ng-1 

E=2^  2^p(1'J) 

i=0  j=0 

(25) 

2 

Contrast  (Con) 

Ng-1      Ng-1  Ng-1 

con =2X2^  2^P(1'J} 

n=0       i=0  j=0 

(26) 

3 

Correlation  (Cor) 

Ng-1  Ng-1 

Cor=         iZ  Zlk-D-Pkj)2 

°x°y  p? 

(27) 

4 

Sum  of  squares:  Variance  (a2) 

Ng-1  Ng-1 

Variance,  ct2  =  ^  ^(i-|a)2.p(i,j) 

i=0  j=0 

(28) 

5 

Inverse  Difference  Moment  (IDM) 

Ng-1  Ng-1 

IDM=ZZ1+fi  ft2  P&i>  <29> 

i=0     j=0  J-1 

6 

Sum  Average  (Mean) 

2(^-1) 

ivicclii,   fj.  —     /  /-Px+yw 
i=0 

(30) 

7 

Sum  Variance  (SV) 

2(Ng-l) 

Sum  Variance,  SV  =   ^(i-SE)2  px+y(i) 

i=0 

(31) 

8 

Sum  Entropy  (SE) 

Sum  Entropy  SE  =  -  ^px+y(i).  logpx+y(i) 

i=0 

(32) 

9 

Entropy  (En) 

Ng-1  Ng-1 

Entropy  E  =  -  ^   X1"-1'  ^'  loSP(i'  J) 

i=0  j=0 

(33) 

10 

Difference  Variance  (DV) 

Difference  Variance,  DV  =Variance  of  px_y 

(34) 

11 

Difference  entropy  (DE) 

Ng-1 

Difference  Entropy  DE  =  -  ^px_y  (i).  logpx_y  (i) 

i=0 

(35) 

12 

Inertia  (I) 

Ng-1  Ng-1 

Inertia,  1=  £   £  (i- j)2.p(i, j) 

i=0  j=0 

(36) 

13 

Cluster  shade  (CS) 

Ng-1  Ng-1 

Ouster  Shade,  CS  =  ^         (i  +  j—  jax  -|ay)3.p(i,  j) 

i=0  j=0 

(37) 

14 

Cluster  Prominence  (CP) 

Ng-1  Ng-1 

cp  =  X  X  (i+j-Hx-Hy)4.p(i,j) 

i=0  j=0 

(38) 

Shape  Features 


Shape  provides  the  geometrical  information  of  an  object  in  an  image.  The  shape  features  such  as  centroid, 
Eccentricity,  area,  perimeter,  circularity,  shape  index,  solidity,  orientation,  euler  number  are  considered. 

CLASSIFICATION  TECHNIQUES 

Various  Classification  methods  have  been  cited  in  the  literature  for  improving  the  classification,  some  of  the 
commonly  used  methods  are  summarized  as  in  Figure  2. 
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KNN 


K-Nearest  Neighbour  (k-NN)  classification  technique  is  the  simplest  technique  that  provides  good  classification 
accuracy  [71].  The  k-NN  algorithm  is  based  on  a  distance  function  and  a  voting  function  in  k-Nearest  Neighbours,  the 
metric  employed  is  the  Euclidean  distance  [21].  The  k-NN  has  higher  accuracy  and  stability  for  MRI  data  than  other 
common  statistical  classifiers,  but  has  a  slow  running  time  [15]. 


Tvpti  of  Claiiifie r*  mt  d  in  MRI  Br  ain  ima?e  tin  b  ifk  ation 


Claaifie  ri  { Supe  rvised  Method) 

*  Stpport  Y*:fir  Ma:hir.i 

*  Prir.-:  pa  L  Oanqpor.sic  ArjLyiU 


A  rtrlcial  Neural  Network 


Feed  Forward  Nettvork 

*  SLr.sLi  Layer  Par c^proa 

*  MtkL  Layar  PaM^r/dra; 


Feed  Back  Network 

*  KolKMvai'i  SOM 

*  Cofinp3titLV3N~3K'ark 


Figure  2:  Types  of  Classifiers  Used  in  MRI  Brain  Image  Classification 


SVM 


Support  Vector  Machine  is  a  supervised  classifier  with  associated  learning  algorithm  derived  from  the  statistical 
theory  [65,  66]  and  it  is  mainly  used  for  the  classification  of  MRI  brain  images  as  normal  or  abnormal  due  to  its 
computational  efficiency  and  good  performance.  It  was  first  developed  as  an  extension  of  the  Generalized  Portrait 
algorithm  [67,  68].  It  works  under  the  principle  of  structure  risk  reduction  from  statistical  learning  theory.  The  SVM 
classifier  is  based  on  the  hyperplane  that  maximizes  the  separating  margin  between  the  two  classes. 

The  basic  SVM  takes  a  set  of  input  data  and  predicts,  for  each  given  input,  which  of  two  possible  classes  forms 
the  output,  making  it  a  non-probabilistic  binary  linear  classifier.  In  addition  to  performing  linear  classification,  SVMs  can 
efficiently  perform  a  non-linear  classification  using  kernel  trick,  implicitly  mapping  their  inputs  into  high-dimensional 
feature  spaces.  To  maximize  the  margin  between  the  classes  [80],  its  kernel  is  used  to  control  the  empirical  risk  and 
classification.  There  are  many  kernel  functions  such  as  linear,  polynomial  of  degree  and  Radial  basis  function  (RBF). 
Among  these  kernel  functions,  a  radial  basis  function  proves  to  be  better  for  MRI  brain  images  [80,  84].  Sumesh  et  al.  [85] 
proved  SVM  with  polynomial  kernel  of  degree  3  shows  better  results  than  those  with  linear  or  RBF  kernel.  Sumesh  et  al. 
[85]  also  proved  level  based  normal -abnormal  classification  shows  better  result  than  non-level  based  classification. 

SVM  has  two  stages;  training  and  testing  stage.  SVM  trains  itself  by  features  given  as  an  input  to  its  learning 
algorithm.  During  training  SVM  selects  the  suitable  margins  between  two  classes.  Features  are  labeled  according  to  class 
associative  with  particular  class.  SVM  is  an  attractive  and  systematic  method  for  two  class  problems.  In  this  research  work, 
many  authors  [1,  11,  12,  34,  47]  are  classifying  MRI  brain  images  into  two  separate  classes  such  as  normal  and  abnormal 
using  Support  Vector  Machine.  Support  Vector  machine  based  classification  of  various  levels  of  MR  glioma  images  are 
performed  [77].  This  method  claimed  to  be  the  better  than  rule  based  systems  but  the  accuracy  is  low. 

Artificial  Neural  Network 

Neural  Network  based  segmentation  is  totally  different  from  conventional  segmentation  algorithms.  In  this,  an 
image  is  firstly  mapped  into  a  Neural  Network.  Where  every  Neuron  stands  for  a  pixel  [32,  48],  thus  image  segmentation 
problem  is  converted  into  energy  minimization  problem.  The  neural  network  was  trained  with  training  sample  set  in  order 
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to  determine  the  connection  and  weights  between  nodes.  Neural  network  segmentation  includes  two  important  steps  feature 
extraction  and  image  segmentation  based  on  neural  network.  Feature  extraction  is  very  crucial  as  it  determines  input  data 
of  neural  network  [81],  firstly  some  features  are  extracted  from  the  images,  such  that  they  become  suitable  for 
segmentation  and  then  they  were  the  input  of  the  neural  network.  All  of  the  selected  features  compose  of  highly  non-linear 
feature  space  of  cluster  boundary. 

Neural  network  based  segmentation  have  three  basic  characteristics: - 

•  High  parallel  ability  and  fast  computing  [32]. 

•  Improve  the  segmentation  results  when  the  data  deviates  from  the  normal  situation  [81]. 

•  Reduced  requirement  of  expert  intervention  during  the  image  segmentation  process. 
However  there  are  some  drawbacks  of  neural  networks  based  segmentation,  such  as:- 

•  Some  kind  of  segmentation  information  should  be  known  beforehand. 

•  Neural  network  should  be  trained  using  learning  process  beforehand  [32] 

•  Period  of  training  may  be  very  long,  and  overtraining  should  be  avoided  at  the  same  time. 
Based  on  the  architecture,  Artificial  Neural  Network  can  be  grouped  into  2  categories. 

•  Feed-Forward  Network 

•  Feed-Backward  Network  or  Recurrent  Network 

In  feed-forward  network,  the  neurons  are  arranged  in  layers  that  have  unidirectional  connections  between  them. 
Feed  forward  networks  produce  only  one  set  of  output  values.  It  is  called  as  static  network  because,  the  output  values  does 
not  depend  on  previous  output  values.  The  output  values  are  produced  only  based  on  current  input.  Feed  forward  network 
is  also  called  as  memory  less  network.  In  feedback  network,  the  neurons  are  arranged  in  layers  that  have  bidirectional 
connections  between  them  [55,  82].  Feedback  networks  produce  a  set  of  values  which  is  updated  based  on  the  output 
values  fed.  Feedback  network  is  also  known  as  dynamic  network  because  the  output  values  depend  up  on  the  previous  state 
values. 

Back  Propagation  Algorithm 

Back  Propagation  algorithm  [27,  53]  is  used  in  layered  feed-forward  ANN.  In  this  network,  the  neurons  are 
organized  in  layers  and  send  their  signals  in  the  forward  direction.  The  errors  generated  are  propagated  in  the  backward 
direction.  The  network  receives  the  input  by  neurons  in  the  input  layer  and  the  output  of  the  network  is  given  by  the 
neurons  on  an  output  layer.  The  network  consists  of  one  or  more  intermediate  hidden  layers.  Supervised  learning  used  is 
Back  Propagation  algorithm.  The  network  is  trained  with  random  weights  and  later  the  weights  are  adjusted  to  get  the 
minimal  error.  The  network  will  be  perfect  if  the  error  is  minimal.  In  ANN,  the  activation  function  of  the  artificial  neuron 
is  a  weighted  sum. 

Aj(x,w)=  ZXiWjj  (39) 

i=0 

The  most  commonly  used  output  function,  sigmoid  function  is  given  as 
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Oj(x,w)  = 


1 


(40) 


1+e 


Ai(x,w) 


The  sigmoid  function  is  '1'  for  +ve  number  0.5  for  zero  and  '0'  for  -ve  number.  The  Error  function  for  the  output 
of  each  neuron  is  given  as 


A  maximum  likelihood  method  was  compared  with  a  neural  network  technique  for  tissue  classification  [16].  In 
this  study,  neural  network  provided  better  boundary  definition  than  maximum  likelihood.  Hall  et  al.  [24]  compared  MR- 
image  segmentation  techniques  based  on  supervised  multilayered  neural  networks  [53]  and  the  unsupervised  fuzzy  C- 
means  algorithm  [7].  These  segmentation  techniques  were  tested  on  MR  images  from  healthy  volunteers  and  selected 
patients  with  brain  tumours  surrounded  by  edema.  The  supervised  and  unsupervised  segmentation  techniques  used  in  this 
study  produced  broadly  similar  results. 

Badran  et  al.  [6]  have  proposed  a  computer-based  technique  for  identifying  the  tumor  region  accurately  in  the 
brain  via  MRI  images.  Here,  the  classification  has  been  performed  on  a  brain  tumor  image  for  identifying  whether  the 
tumor  is  a  benign  or  malignant  one.  The  steps  involved  in  the  proposed  algorithm  were  preprocessing,  image  segmentation, 
feature  extraction  and  image  classification  via  neural  network  techniques.  Finally,  using  the  region  of  interest  technique, 
the  tumor  area  has  been  located. 

R.  Mishra  [43]  has  developed  an  efficient  system,  where  the  Brain  Tumor  has  been  diagnosed  with  higher 
accuracy  using  artificial  neural  network.  After  the  extraction  of  features  from  MRI  data  by  means  of  the  wavelet  packets, 
an  artificial  neural  network  has  been  employed  to  find  out  the  normal  and  abnormal  spectra.  Normally,  the  benefit  of 
wavelet  packets  is  to  give  richest  analysis  when  compared  with  the  wavelet  transforms  and  thus  adding  more  advantages  to 
the  performance  of  their  proposed  system. 

MR  image  segmentation  based  on  feed  forward  neural  networks  relies  heavily  on  the  training  set  used  for  their 
supervised  training.  The  training  set  is  constructed  by  selecting  feature  vectors  from  a  single  MR  image  or  an  ensemble  of 
MR  images  [26].  Many  researchers  proved  that  the  multi-layer  propagation  with  3  layers  can  perform  arbitrarily  complex 
classification. 


Self-organizing  maps  (SOM)  is  an  unsupervised  clustering  network  that  maps  inputs  which  can  be  high 
dimensional  to  one  or  two  dimensional  discrete  lattice  of  neuron  units  [63].  Input  data  is  classified  according  to  their 
grouping  in  input  space  and  neighbouring  neuron.  SOP  consists  of  two  layers:  Input  layer  and  competitive  layer.  The 
number  of  neurons  in  first  layer  is  equal  to  dimension  of  input  and  the  number  of  neurons  in  the  second  layer  is  equal  to 
number  of  classes  or  clusters.  Each  connection  from  input  layer  to  a  neuron  in  competitive  layer  is  assigned  with  a  weight 
factor.  The  SOM  functions  in  two  steps.  Firstly,  finding  the  winning  neuron  i.e.  the  most  similar  neuron  to  input  by  a 
similarity  factor  like  Euclidean  distance,  and  secondly,  updating  the  weight  of  winning  neuron  and  its  neighbour  pixels 
based  on  input. 

The  famous  unsupervised  approach  developed  by  [35]  uses  a  linear  update  rule  for  the  weights.  This  network  is 
used  for  classification  of  tissues  in  MR  brain  images.  Chou  et  al.  [13]  previously  introduced  kohonen's  SOM  for 
segmentation  of  dual-echo  MR  images.  When  Chou  et  al.  [13]  used  the  SOM  of  kohonen,  only  normal  tissues  (white 
matter,  gray  matter  and  cerebrospinal  fluid)  were  classified  in  the  MR  brain  images.  Their  technique  was  not  able  to 


(41) 
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classify  the  abnormal  tissue  in  the  brain  images.  To  overcome  this  drawback,  Javad  et  al.  [4]  classified  the  unknown  tissue 
as  a  tumour. 

A  new  unsupervised  MRJ  segmentation  method  based  on  self -organizing  feature  map  was  presented  by  Yan  Li 
and  Zheru  Chi  [76].  Their  algorithm  included  extra  spatial  information  about  a  pixel  region  by  using  a  Markov  Random 
Field  (MRF)  model.  The  MRF  term  improved  the  segmentation  results  without  extra  data  samples  in  the  training  set.  The 
cooperation  of  MRF  into  SOM  has  shown  its  great  potentials  as  MRF  term  models  the  smoothness  of  the  segmented 
regions.  It  verified  that  the  neighbouring  pixels  should  have  similar  segmentation  assignment  unless  they  are  on  the 
boundary  of  two  distinct  regions. 

Hybrid  Techniques 

Several  hybrid  neuro-fuzzy  approaches  for  MRI  brain  image  analysis  are  reported  in  the  literature.  A 
combinational  approach  of  SOM,  SVM  and  fuzzy  theory  implemented  by  [30]  performed  superiorly  when  compared  with 
other  segmentation  techniques.  The  SOM  is  combined  with  FCM  for  brain  image  segmentation  [45]  but  this  technique  is 
not  suitable  for  tumours  of  varying  size  and  convergence  rate  is  also  very  low. 

A  hybrid  approach  such  as  combination  of  wavelets  and  support  vector  machine  (SVM)  for  classifying  the 
abnormal  and  the  normal  images  is  used  by  [11].  This  report  revealed  that  the  hybrid  SVM  is  better  than  the  kohonen 
neural  networks  interms  of  performance  measures.  But,  the  major  drawback  of  this  system  is  the  small  size  of  the  dataset 
used  for  implementation  and  the  classification  accuracy  results  may  reduce  when  the  size  of  the  dataset  is  increased.  A 
modification  of  conventional  SVM  such  as  Least  square  SVM  for  brain  tumour  recognition  is  proposed  by  [41].  A  Hybrid 
approach  for  pattern  classification  is  reported  by  [39].  The  combination  of  SVM  and  fuzzy  rules  is  experimented  in  this 
work.  The  results  revealed  that  the  proposed  hybrid  approach  is  accurate,  fast  and  robust. 

Mehdi  jafari  and  shohreh  kasari  [84]  combined  automatic  region  growing  method  neural  network  classifier  to 
segment  and  classify  the  brain  image.  Their  proposed  method  achieved  100%  sensitivity  rate  and  96%  specific  rate.  Eltaher 
mo  hammed  Hussein  et  al.  [85]  proved  that  elman  network  gives  better  result  when  compared  to  BPNN  and  RNN 
networks. 

Venkateswara  reddy  et  al.  [86]  proposed  a  method  combining  modified  FCM  and  SVM  for  MRI  image 
segmentation  and  classification.  They  proved  their  method  produced  better  results  in  detecting  tumour  when  compared  to 
the  other  combination  of  KFCM+NN,  FCM+NN,  FCM+SVM. 

CONCLUSIONS 

Many  image  segmentation  methods  and  classifiers  have  been  developed  in  the  past  several  decades  for 
segmenting  MRI  brain  images  and  classifying  it  as  normal  or  abnormal.  The  survey  shows  that  BPN  classifier  with 
increase  in  nodes  gives  fast  and  accurate  classification  that  can  be  effectively  used  for  segmenting  MRI  brain  images  with 
high  level  of  accuracy  and  the  log  sigmoid  function  is  the  best  activation  function  for  the  medical  image  recognition 
application.  The  survey  also  shows  that  Elman  network  gives  better  result  compared  to  BPNN  and  RNN  network.  Also 
SVM  can  classify  the  brain  image  as  normal  or  abnormal  more  accurately  than  SOM  neural  network.  At  last  it  is  concluded 
that  even  though  several  hybrid  methods  have  been  developed  but  still  it  remains  a  challenging  task.  One  hybrid  method 
tries  to  prove  it  is  superior  than  other  existing  hybrid  methods.  Thus  it  is  very  hard  to  specify  the  superior  method.  In  this 
work,  the  merits  and  demerits  of  various  automated  techniques  for  brain  tumour  identification  is  analyzed  in  detail. 
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